Structuring of Baseball Live Games Based on Speech Recognition Using Task Dependent Knowledge
نویسندگان
چکیده
It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we propose in this paper a speech recognition method of incorporating the baseball game knowledge such as counting of inning, out, strike and ball. Due to this taskdependent knowledge, the proposed method can effectively prevent speech recognition errors. This method is formalized in the framework of probability theory and implemented in the conventional speech decoding (Viterbi) algorithm. The experimental results showed that the proposed approach improved the structuring situation segmentation accuracy as well as keywords accuracy.
منابع مشابه
Situation based speech recognition for structuring baseball live games
It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy, emotional and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we have been studied the speech recognition method incorporating the baseball game task-dependent knowledge as well as an announcer’s emotion i...
متن کاملStructuring of baseball live games based on speech recognition using task dependant knowledge
It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we propose in this paper a speech recognition method of incorporating the baseball game knowledge such as counting of inning, out, strike and ball. Due to...
متن کاملReal-Time Closed-Captioning Using Speech Recognition
There is a great need for more TV programs to be closed-captioned to help hearing impaired and elderly people watch TV. For that purpose, automatic speech recognition is expected to contribute to providing text from speech in real-time. NHK has been using speech recognition for closed-captioning of some of its news, sports and other live TV programs. In news programs, automatic speech recogniti...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملLive speech recognition in sports games by adaptation of acoustic model and language model
This paper proposes a method to automatically extract keywords from baseball radio speech through LVCSR for highlight scene retrieval. For robust recognition, we employed acoustic and language model adaptation. In acoustic model adaptation, supervised and unsupervised adaptations were carried out using MLLR+MAP. By this two level adaptation, word accuracy was improved by 28%. In language model ...
متن کامل